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We study a homo-polymer model of a protein chain, where each monomer 
carries a dipole moment. To mimic the geometry of the peptidic bond, these 
dipoles are constrained to be locally perpendicular to the chain. The tenso- 
rial character of the dipolar interaction leads naturally to a (tensorial) liquid 
crystal-like order parameter. For non chiral chains, a mean field study of this 
model shows that a classical 9 collapse transition occurs first; at lower tem- 
perature, nematic order sets in. For chiral chains, an anisotropic (tensorial) 
collapse transition may occur before the 6 temperature is reached: the or- 
dered phase can be described as a "compact phase of secondary structures" , 
and possesses great similarities with the liquid crystal blue phases. 
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I. INTRODUCTION 



Proteins are chiral heteropolymers, made out of twenty species of monomers 
(aminoacids). Natural proteins have the property of folding into an (almost) unique compact 
native structure, which is of biological interest HQ]. The compactness of this unique native 
state is largely due to the existence of an optimal amount of hydrophobic aminoacid residues 
||, since these biological objects are usually designed to work in water. The existence of 
secondary structures M plays a predominant role in the existence of the unique native state. 
From a theoretical point of view, one may either emphasize the heterogeneous character of 
these biopolymers ||, or choose to focus on the more crystalline-like secondary structures 
(a-helices and /3-sheets). In the first approach, folding is identified to a freezing transition; 
in the second approach, that we follow in this paper, the folding transition is more like a gas- 
solid transition. Previous studies along the latter path made use of simplified lattice models 
0. We consider here a microscopic continuous model of a homopolymer chain, where each 
monomer carries a dipole moment. The rationale for this choice is clear if one interprets the 
chemist's hydrogen bonds as the interaction between electrostatic dipoles (the secondary 
structures approximately corresponding to the classic dipole equilibrium positions). The 
dipolar moment of a CONH is po ~ 3.6D, yielding a typical dipolar temperature (temper- 

2 

ature below which dipolar effects become relevant) of Tp = 47r ^° d3 where e r is the relative 
dielectric constant of the medium, and d is a typical distance between dipoles. The electro- 
statics of proteins in water is a somewhat controversial issue 0, so we will content ourselves 
with an estimate of To in the native state. It is then appropriate to take e r ~ 5 [0]. With 
d ~ 3.8 A Q, we find T D ~ 2200 ~ AQ0K ? go that bipolar effects may play a crucial role in 
the energetics of the folding (or unfolding) transition. The spatial range of the dipole-dipole 
interaction can be finite or infinite: its angular dependence (or tensorial character) is the 
crucial issue of this paper. To make a closer connexion with proteins, we assume that the 
dipole moments are constrained to be perpendicular to the chain. In this way, a monomer 
can be thought of as linking two successive C a carbon atoms, and its dipole moment rep- 
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resents the peptid CONH dipole. A finite range of the dipolar interaction may further 
describe the finite range character of the hydrogen bonding interactions. The Hamiltonian 
of the model then reads 

^=yE m - rj) + § EEf^(n, r 3 ) V ) (1) 

In equation ([1]), /3 — ^ is the inverse temperature, v is the excluded volume, r*j denotes the 
spatial position of monomer i (i = 1,2, ...N), and pi its dipole moment. If necessary, three 
body repulsive interactions may be introduced, to avoid collapse at infinite density. The 
(infinite range) dipolar tensor reads 

G a ~/(r, f) = -3,3 (Aery - 3w Q w 7 ) (2) 

with v a = ^fez^r an d ^ is a prefactor containing the dielectric constant of the medium. The 
dipolar interaction (|^) is cut-off at small distances (|r — r*\ < a) and may also be cut-off at 
large distances by an exponential prefactor. The partition function of the model ([]]) is given 
by: 

Z = fU *W« W« " nl - «) - Po) ^ ■ - ?,)) exp (-3H) (3) 

In equation (||), a denotes the Kuhn length of the monomers, and po is the magnitude of their 
dipole moment. The third 5-function restricts the dipole moment to be perpendicular to the 
chain; this mimics the free rotation of the CONH dipole around the C a — C a "virtual bond" . 
As it stands, this partition function is not easy to evaluate. In particular the constraint of 
fixed length dipoles is not easy to implement. We have therefore studied the constrained 
dipolar chain in (i) in the framework of the virial expansion, with fixed length dipoles (ii) in 
the framework of a Landau theory, where we replace the fixed dipole length by a Gaussian 
distributed length. 

II. THE VIRIAL APPROACH 

Since the virial extension is quite familiar in the field of polymeric liquid crystals M , we 
will be rather sketchy in our presentation. Starting from equations ([l|-|3|), we write 
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exp -P £ T,P? G *y(n, rj)p] = IK 1 + fa) 
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- i • Yl fu ■ ■■■ 
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where = f(fi — fj) — exp (—(3pi ■ Gy • p^) — 1. To lowest non trivial order in /, we get: 

2 = / n^^d^+i _ ~ a ) ex p (-^po«({^})) 

i 

x exp J dpidpjfij 5(1^1 - p ) 5(p { ■ (f i+1 - fi))5(\pj\ - p ) 5(pj ■ (r j+1 - fj)) \ (5) 
where f3TL po i = y J2i^j ^ifi ~ fj) ■ Introducing the joint distribution function: 

g(r,u) = ^2S(f-fi) 8(u-Ui) (6) 

i 

where Ui = (r i+ i — fi)/a, the partition function fl5|) reads: 

Z = j V 9 V 9 ex P U J drdu 99 ~ ^ J dr(J du g) 2 ^ 

x exp (- J drdr dudu g(r, u)V{r, ■u|r y , u)g(r > , u') + logC(g)^ (7) 

where 

V(r, u\r > , u') = J dpdp5(\p\ — po) 5(p ■ u)5(\p\ — po) 5{p ■ u)f{f — r 1 ) (8) 
and ( is the polymeric partition function given by: 

((g) = J Rdn 5(\f l+1 - n\ - a^-*^ (9) 

i 

In the spirit of mean field or variational approaches ||, we look for a homogeneous (space 
independent) distribution function: g(r,u) = p(f>(u), where p = ^ is the monomer concen- 
tration and the angular distribution function <ft is normalized to 1. The free energy per 
monomer reads: 

(3F = J du<f){u) log(0(w)) + ^P~\p J dudu'<j){u)V{u ■ u)<p(u) + 0{p 2 ) (10) 



where 

V(u ■ u') = J dfdpdff 5(\p\ — Po) 5(p- u)6(\j?\ — Po) 5(ff ■ u')f{f) (11) 

and the 0(p 2 ) term in equation ( |10D represents the three body repulsive interactions. Using 
the Onsager variational form || 
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cosh(acos#) (12) 
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where 6 is the polar angle of the unit vector u, we obtain 

Ot 2 7T 

(3F = log( ) + — — (arctan(e a ) - -) 

Aire tanh a sinh a A 



+ yP (^1 - ? - g^((l - 2^(«)) 2 + 2^(a) - -) j + C(p 2 ) (13) 

where t = jr, with Tg = |y^To, and u;(a) = ( co ^ ia — ^). We have also assumed that 
d ~ a, i.e. that an estimate of the distance between dipoles is given by the monomer length. 
Minimizing equation ( |13|) with respect to a and p yields the following results: 

• i) A usual second order #-like collapse transition to a finite density phase occurs at 
temperature t = 1. Since vq ~ a 3 , we have Tq ~ Tq. 

• ii) In the dense phase, a discontinuous nematic-like transition occurs at temperature 
£jv = jr- — \i with a small density discontinuity. Since T N is a variational estimate, 
the true transition temperature is necessarily higher. 

These results show that dipolar interactions are responsible for both transitions. We remind 
the reader that they are expected to be correct at small monomer concentration, since they 
rely on the virial expansion. At higher monomer concentration, one may find other phases 



(e.g. crystalline phases ||10|| ). An extension of this approach to chiral chains is possible 
||1 1|| . In this case, one expects to encounter a phase transition towards an inhomogeneous 
phase (e.g. cholesteric) at a temperature T H higher than T N . The possibility of a crossing 
between Tq and X# must therefore be considered. As shown in the next section, the Landau 
approach to the constrained dipolar chain provides a more convenient framework to study 
this possibility. 



III. LANDAU THEORY 



A. Formal developments 

At a qualitative level, experimental phase diagrams are usually in agreement with the 
Landau theory of phase transitions. Roughly speaking, this theory, which rests over the use 
of symmetry arguments, may be summarized as follows 

• (i) one looks for an order parameter Q{r) that describes the ordered phase(s) at point 



r. 



• (ii) one builds a free energy density as an expansion in Q(f), and in the first spatial 
derivatives of Q(r). 

• (iii) one minimizes the total free energy with respect to Q(r). 

This theory clearly assumes that the order parameter Q(r) varies on distances much larger 
than the microscopic length scales. Following this line, we first soften the constraint of fixed 
length dipoles in equation (||) and replace it by a Gaussian constraint (the replacement of 
fixed length by Gaussian spins is familiar in the theory of phase transitions). We therefore 
have 

f 1 -4r 

Zl= T\dfidpi 6(\f i+ i - fi\ - a) — — 2— e 2p o 5 fa ■ (f i+1 - r t )) exp (-/3H) (14) 

where the subscript L on the partition function stands for Landau and the Hamiltonian Ti 
is given by equation ([!]). Using the identity 
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5(y) = lim ( -- j e"^ (15) 



we may now perform the (Gaussian) integrals over the dipole moments pi in equation (|14|). 



As a result, the problem now depends only on the polymeric degrees of freedom. Introducing 
the tensorial (order) parameter Q ail {r) by 

Qan{r) = {{^i)a{Ui)n - <W) S(r- r { ) (16) 



where the notation vii = (r*+i — r«)/a was again used, we get 

z l = J II VQa^r) 5 \.Q aiJb {r) - _ <W) S(f-fi 

(a,u),r \ i 



J Y[dfi 5(\f i+1 - fi\ - a) exp(-f3H po i) exp (-^Tr log (17) 

where the matrix B is defined by its matrix elements 

B ai (f, 7) = S a7 8(f- 7) + Pp 2 Q a ,(r)G^(f, 7) (18) 

m 

Using the identity 

S (Qan ~ da^ = f V ® af * exp iQ a (Q afl - d a/1 ) (19) 

one may write 

Z l = J II ^Q afl (r)VQ a ^(r)exp(i J dr^Q a ^(r)Q afJL (r)) 

(a,n),r ' a<p, 

exp (-(3H pol (Q)) exp (-±Tr logB^ exp(logC(Q)) (20) 

with 

C(Q) = /il^ _ ^1 _ a ) ex P I _ ^ ^Qa^XO^aO^)** ~ <W) ) ( 21 ) 

i y a<A* i / 

and where we have expressed H po i as a function of Q (see equation (|23| ) below). So far, no 
approximations beside the assumption of a Gaussian distributed length of the dipoles were 
made. We stress the fact that the number of independent variables Q afJl (r) is equal to six, 
since Q is a symmetric rank 2 tensor. This is why the sum over the components (a, jj) in 
equations (p0| , ^TJ) is restricted to a < 



B. The physical order parameters 

At this stage, it may be useful to establish the relation of the previous section with the 
virial approach. It is clear that the tensor (or matrix) Q(f) can also be expressed through 
the joint distribution function of equation (El) as 



Qan{r) = du g(r,u) ((u) a (u)^ - S afl ) (22) 



Furthermore, one may build from Q(r) a (scalar) density 



p(r) = -~ Tr Q(f) (23) 



and a traceless tensor 



q(f) = Q(f) - ^Tr Q(f) (24) 

where I is the unit tensor. Assuming that both quantities are space independent, we easily 
recover the results of section ||. For space dependent order parameters, the Landau- Ginzburg 
free energy density Thif) is defined by: 

Z L = J Vp{r)Vq a „{r)e-PS d?:F ^ (25) 

It can be evaluated by 

• (i) performing a saddle point method on Q in equation ( p0| ) and replacing Q as a 
function of Q. 

• (ii) expanding the resulting free energy density in the exponent of ( P0| ) in powers of Q 
and performing a local gradient expansion. Note that all terms of the expansion could 
have been obtained from symmetry considerations. 

• (iii) expressing the free energy density in terms of p(r) and q(r). 
The resulting Landau free energy density reads: 

^(r) = |(Vp(f)) 2 + |(T - T e ) P \f) + |p 3 (f) + ■ ■ ■ 

+ ^-d a q a ^(r)d a/ q a ^(r) + y9 7 g QAt 9 7 g a ^ + y (T - T )Tr q 2 (f) 
-|Trq 3 (r) + |(Trq 2 (f)) 2 + ... 
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+ ^P{r)d 2 n a ,{r) - ^p(r)Tr q 2 (f) + • • • (26) 
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where we use the standard summation over repeated indices. The various coefficients a, 5, c 
in (E?j) can in principle be calculated using the method described above. In particular, 
the temperature Tg is the same as that defined in section [TT] and To would be the nematic 
transition temperature if there was no cubic term in q in equation (|26|). 



C. Including chirality 

The observation that the backbone of proteins has chirality can be appreciated through 
the Ramachandran plot fl| . The chirality of the C a "virtual chain" is predominantly associ- 
ated with the (f) dihedral angles. It is not easy to take chirality into account at a microscopic 



level ||12|| , while it is so at a Landau-like level ||13|| , since then, only very general symmetry 
considerations matter. In particular, if the system is described by a tensor q(r), the ab- 
sence of inversion symmetry ensures that a new (scalar) invariant shows up in equation (f26[), 
namely 

Fchiraiir) = D ea^qasd^qus (27) 

where D is a measure of the strength of the chirality and e aiiV the antisymmetric third rank 
tensor. The full Landau free energy density jF|°*(r) of the constrained dipolar chiral chain 
then reads 

H 0t (?) = FLif) + Fehiralif) (28) 

where T^if) and Fchiraiij) are given respectively in equations ( p6[) and (f27|). It is clear that 
equation (|28|) describes the coupling between a usual 9 collapse described by p(f) and blue 
phase- like ordering |L4| described by q(f). For our purposes, it is important to note that 
equation ([Z7]) implies that a modulated order in q(r) sets in at a temperature T x which is 
higher |I1] , |T5|| than the temperature Tjy of the uniform (nematic) order; on the other hand 
equation ( p6|) shows that, to lowest order, the 6 collapse occurs at the same D-independent 
temperature Tg. Before exhibiting the crossing of the two temperatures T x and Tg in a 
specific case, we briefly summarize the existing state of the art for blue phases. 



D. Summary of the blue phases folklore 



In chiral systems, the blue phases (BP) may show up in a narrow temperature range 
(< IK) between the simple cholesteric (helical) liquid cristal phase and the isotropic liquid. 
Their theory is quite complicated |T^JT5[| ; their ordering is described by a symmetric rank 2 
tensor which can, in some appropriate limits, be linked to the molecular orientations. So far 
two cristalline phases (BP I, BP II) and one non-cristalline phase (BP III) have been found. 
More complicated phases may exist when an electric field is present. 

The cristalline phases are characterized by a body centered (BP I) or simple cubic (BP 
II) lattice. The unit cell contains of the order of 10 7 molecules, so that the Landau ex- 
pansion may be a priori trustworthy. Note that these cristalline structures are threaded by 
disclinations (line defects of the molecular orientations). 

The non cristalline phase (BP III) is of particular interest ||16|| . It has the same macro- 
scopic symmetry as the isotropic liquid, and the (BP III)- (isotropic liquid) phase transition 
has a critical end point, much like the liquid-gas transition. Furthermore, attempts have 
been made to describe this phase as resulting from an unstable localized mode (see the 
references in [0), as opposed to an extended mode for the cristalline BP's. 



IV. A SIMPLE EXAMPLE OF ANISOTROPIC COLLAPSE 

We will exhibit, in a simple way, the crossing between the 9 (uniform) collapse tempera- 
ture Tg and the temperature T x below which a modulated order sets in. Following references 



15| . [l7|| , we consider in equation (p8|) the appearance of helicoidal order in the tensor q(r). 
At this point, it is perhaps useful to recall the reader that the various "helices" of this 
section are not the a-helices present in proteins. Indeed, these "helices" are linked to the 



order parameter of equation (|24|), and, as in all polymeric problems, it is difficult to deduce 
the chain configuration(s) from the physical order parameter (s). The q(r) helicoidal order 
is described by: 
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Qan(z) = —fc qsmcj) M afl + —7= gcos0 N afl (z) 



where the matrices M and N(z) are given by 

/l \ 



(29) 



M 



1 
\0 -2/ 



(30) 



and 



/ cos koz sin k z \ 
N= sin k z — cos k z (31) 

V 00; 

where ko = §-e z is the wave vector of the modulation. Usual cholesteric order (i.e. the 
"uniaxial" helix) corresponds to <p — f , while the purely biaxial helix is described by <fi = 0. 
We insert this trial order parameter in equation (|28| ) and get: 



f (V^r-)) 2 + |(T - To) p\r) + |p 3 (f) + 



+ (6 2 (T-T )- Cl p{r)) 



sin , 



g 3 g 4 
¥ + 64 24 + 



jT)2 

^ 1 9 

2% 5 



(32) 



This free energy density is to be minimized with respect to q, <fi and p(f) . Of particular 
importance are the terms which couple the order parameters in (p8|) . With our choice of the 
order parameter q afJ- (z), the first non zero term corresponds to the C\ coefficient of equation 
(Bp). Moreover, on physical grounds, this coefficient must be positive, since an increase of the 
density is expected to favor the appearance of a liquid crystal type of ordering. Furthermore, 
the order in p(r) can be shown to be uniform (k = ordering). Due to the large number of 
coefficients in the free energy density, it is appropriate to work with dimensionless quantities, 



and we shall follow the notations of reference Hlg] . The rescaled total free energy per unit 
volume / reads: 



/ = ±(t-ac)Q 2 ~\k 2 Q 2 cos 2 4 
-Q 3 sin 30 + g 4 + ^'c 2 + c 3 



(33) 
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with 



4 63 n 2 63/3 ( 'iA \ 

q = 7th Q > P= 3^p c ' (34) 

t = ^(T-T Q ), t = y(T-T e ), (35) 



3 a 2 b A b\ 

^*WW' s= isb^ (36) 



, 7 / ,2/3 1/3 

1 o 1 2 °4°0 °3 a 3 lon x 

K = "ftp Cl = 12 a ' ( > 



and 



*?=(f)'j}/. (38) 
We just quote here the conclusions of a study of equation (|33|) . Technical details will appear 



elsewhere [ 18| . The phase diagram can be studied in the (temperature, chirality) plane. For 



high enough values of the chirality parameter k, 



" 2 [ l ~ 32 7( ^-1r 9 -r„)) J > 9 (39 » 

we get a continuous transition towards a biaxial helix at a reduced temperature t x = k 2 
corresponding to a real temperature 

T x = T + 5k 2 (40) 

Below the transition, the reduced density grows as c ~ Q 2 . The low temperature phase can 
be viewed as a compact phase with helical order. From a protein-oriented point of view, 
this would correspond to a direct transition from a coil state (denatured) to a native state 
(i.e. compact with secondary structures). 

At much lower chirality, when T x < Tg, the scenario is more like the one in section 
[IT]: there is first a 9 collapse transition at t' e = 0, followed at a lower temperature by a 
discontinuous transition towards a "uniaxial" helix. 

At intermediate chiralities, several phase diagrams are possible, depending on the relative 
values of the parameters. In particular, the location of the phase transition between the 
compact modulated phases (biaxial and uniaxial) is very model dependent. 

In figure 1, we display a plausible phase diagram. 

12 



V. CONCLUSION 



Using the fact that dipolar interactions imply the existence of a tensorial order param- 
eter (see equation (P])), we have shown that chiral chains may have isotropic (0-like) or 
anisotropic (blue phase -like) collapse transitions. This result was found in a simple model 
with helicoidal order. The connexion with blue phases is perhaps not very surprising since 
the molecules of some of these compounds have dipoles perpendicular to the molecular axis. 
It is clear that the same questions that arise in the study of blue phases should arise, mutatis 
mutandis, in the study of chiral dipolar chains: 

• (i) For very chiral systems, is the continuous transition towards the compact biaxial 
phase preempted by a discontinuous transition to a compact phase with a (hexagonal, 
body-centered cubic,...) superposition of biaxial helices? 

• (ii) Can one find a description of low chirality systems in terms of double twist cylin- 
ders? 

• (iii) Can one link the order in q(f), the chain configuration(s) and the dipole con- 
figuration^)? Does a cristalline order in q(r) imply the existence of defects in the 
orientation of the chain ? 

The application of the previous results to (bio)polymers can be roughly summarized as 
follows. Since we are concerned with collapsed phases with uniform density, we are typically 
in a melt situation. The study of a single chain (such as a protein) is presently out of reach. 
In both cases, a more detailed analysis requires a better experimental knowledge of the 
various parameters (chirality,...). Nevertheless, we think that our approach is useful for the 
protein folding problem. In particular ||18|| , one may look for a double twist interpretation 
of a-helices and /3-sheets, the possible existence of localized instabilities (such as the ones in 
BP III [|19|) and their implications for the folding process. Along these lines, it is reassuring 
(although not completely unexpected) to note that the secondary structures of real proteins 
are well characterized by a q(r) tensorial order parameter [2(J. 
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Figure caption: Schematic phase diagram of the model in the (temperature, chirality) 
plane. The phases are labelled by C (coil), IG (isotropic globule), AGi and AG 2 (anisotropic 
globules). The C — IG transition, and the C — AG2 transition (for large enough chirality), 
are continuous. 
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